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Abstract. We give derivations of some basic results for the Bernstein approxi- 
mation in n variables that are useful in investigating copulas. It is shown that 
Bernstein approximations of copulas are again copulas. We exhibit a stochastic 
interpretation for the Bernstein approximation of a copula. 



1. Introduction 



Bernstein approximations of 2-copulas were introduced and studied in [3] and [l]. 
We assume the reader is familiar with copulas; see, for example, [7] or [6]. 

This note was written to clarify for myself and my colleagues certain properties of 
Bernstein approximations that are useful in investigating copulas. We derive some of 
the basic properties of the Bernstein approximation for functions of n variables and 
then show that the Bernstein approximation of a copula is again a copula. Our most 
significant result is a stochastic interpretation of the Bernstein approximation of a 
copula. This interpretation was communicated to us by J. H. B. Kemperman in [2] 
for 2-copulas and we are not aware of its publication elsewhere. 

The encouragement and contributions of our colleagues P. Mikusihski and X. Li to 
this note were crucial. 



It is our convention that / = [0, 1]. 
Definition 1. The m-th degree Bernstein polynom iai bj iY, '. I — y M. is given by 



/ = 0, 1, . . . , m. We extend this to B"^ :/"-)■ M by taking / to be a multi-index, 
/ = (/'i, . . . , /„), where each ik G {0, 1, . . . , m} and setting 

where x = (xi, . . . , x„) 6 /". 

Notice that {B"^} is a partition of unity over /". 

Here is the intuition behind the Bernstein polynomial: Consider the act of tossing 
a coin m times with probability of heads on each toss being x. This scenario can be 
represented by a random vector X : Q ->■ {0, l}*" with the property that if X{lj) = 
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(xi, . . . ,Xm), then P(x, = 1) = x. We then introduce the random variable Y defined 
by 

= Y.{x, : X, = 1}, 

in other words, the number of heads that were tossed. This is familiarly described as 
a binomially distributed random variable with parameters m and x. It is easily shown 
that 

£(y) = mx and W ariy) = E{iy - Eiy)f) = mx{\ - x). 

Notice that 

b,,^{x) = P{Y = /). 
Definition 2. If f : /" M, we define the Bernstein approximation to f to be 

where ; ranges over all multi-indices / = (/'i in) such that each 6 {0, 1 m}, 

and by j- we mean the vector i-^, ■ ■ ■ ,-^)- 



It can be shown by induction that 

f n n 1 

Jin {m+iy 
where A" is Lebesgue measure on E". 



3. The Weierstrass approximation theorem via Bernstein polynomials 

Tlieorem 1. Iff : /" — > M is continuous, tlien 23m(f) — > f uniformly on /" as m —>■ oo. 

Proof Choose e > 0. Since f is uniformly continuous on /", there exists 5 > with 

the property that if x = (xi Xn), y = (yi yn), and |x/ — y,| < 6 for all /, 

then |f(x) - f(y)| < e. In what follows, it is convenient to define d by d{x,y) = 

max{|xi -yil |x„ -yn|}. 

Set fm = 'Bm(f)- We suppose that m is so large that < e; we shall show that 
this makes \ fm — f \ "small." Choose x = (xi, . . . , Xn) 6 /". Then 

iax)-f(x)|< J2 \f{i^)-f{x)\B'Ux)+ \f{i^)-f{x)\B'Ux) 

where / = (/'i /„), a multi-index. We see that 

E K(^)-f(x)|e,':„,(x)<e 

d(^,x)<5 

by uniform continuity of f. To find a bound for the other term, we first introduce 
independent, binomially distributed random variables Xi, . . . ,Xn with parameters m 
and X. By Tchebycheff's inequality, for each j = 0, 1, . . . , m we have 

m(52 - 4m5^ 
Let /W = max f . Then 

E |f(^)-f(x)|e,';^(x) < 2/w E 







>6^ < 


( 


- Xj 

m 
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We see that 



<Ep 



(-■ 






V m 


"ml 







Xj 




( 


— - x^j 
m 







Thus \ fmix) - f(x)| < e + 2Mne, and we are done. 



□ 



4. Derivatives of Bernstein approximations 
4.1. Derivatives of Bernstein polynomials. \i f : 1" ^ M., set 

(1) f„, = 'B"^{f) = E ^ (^) = E ^ (^) ® • • • ® 

where / = (/'i, . . . , /„) and //( = 0, 1, . . . , m and the symbolism g ^ h \s interpreted to 
mean (g /7)(u, i/) = g(u) h{v). We want to compute partial derivatives of fm- In 
particular we want compute the mixed partial g^^7.Qx„ which we denote dfm- In the 
case where fm is a cumulative probability distribution function, dfm is the associated 
probability density. 

It is convenient at this point to introduce another notation. Let g : /4 -> M where 
A C M". Suppose V G E" such that An{A- v) ^ We then define a function 
AyQ : Ar]{A - v) ^ Rby A^gip) = g{p+ v) - g{p). That is, Ayg{p) is the variation 

of g starting at p in the direction v. Next, let ei e„ be the standard orthonormal 

basis for K", that is, d = (1, 0), 62 = (0, 1, 0), etc. For f :/"-)■ IR and 

/ = (/'i, . . . , ;„), where /;< = 0, 1, . . . , m — 1, we define 

We can think of A"^f as the variation of f over the n-dimensional square [■^, x 
■■■xf^,i^). 

Returning to the problem of derivatives, one calculates 

{-mbo,m-i. ' = 0, 

i = m. 

If we set 6_i,m-i = bm,m-i = 0, then we may reduce this to 

(2) b' m = m {bi-i,m-i - bi,m-i), < i < m. 
If one then considers the case where n = 1 so that 

m 

/=0 

where / is now an integer, then one easily calculates 

m-l 

(3) '^- = '"E(^^..^(^)) 

j=0 
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We then pass to the general n-dimensional case where fm has the form given in ([T]) 
and by repeatedly invoking the 1-dimensional case and Equation ([3|), we obtain 



Dj„.m-1 



where j = (ji< ■ ■ ■ Jn) and Jk = 0,1, . . . , m - 1. 

It is well-known that the Bernstein approximation of a copula is again a copula 
(see, for example, [3] and [4]), but this is also an immediate consequence of this last 
formula: 

Theorem 2. The Bernstein approximation of an n-copula is again an n-copula. 

Proof. Let Cm = 25[^(C) where C is an n-copula. The boundary conditions for a 
copula are easily checked. The only questionable condition is whether or not Cm is 
n-increasing. But this follows from the fact that the terms of 



dCm = m" ^ (Aj:„C) bj„m-l 



m-1 



are nonnegative. □ 

4.2. Some identities and estimates. In what follows, we assume that x £ I, m = 

1,2,3 and / = 0, 1 m. 

The following is straightforward to establish by induction over /: 



Proposition 1. 



m 



x) bi,m{x) = X {1 - x) (bi- 



iW). 



Proposition 2. 



E 

/=0 



m 



\bi-l.m-lix) - bi,m-lix)\ 



1 

m ' 



Proof We assume that X is a binomially distributed random variable with parameters 
X and m and make use of Proposition [T] 

2 



E 

/=0 



X 

n 



\bi-l,m-l{x) - bi,m-l{x)\ 



x(l-x) 



/=0 



— r — Var{X) 

x(l - x) 



1 

m ' 



□ 



Proposition 3. 



E 

/=0 



m 



bi,m{x) = 2x(l -X) m_i(x) 



wliere /'o = [mx\, tlie greatest integer less tlian or equal to mx. 

Proof. Let us assume x is irrational, < x < 1. There is a unique nonnegative 
integer, namely /q = [mxj, such that 

/o + 1 



^<x< 
m 



m 
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We then perform a calculation in which we invoke Proposition [T] 



;=0 



/=0 



m 



/=/o + l 



m 



/=/o + l / 
= 2x(l -x)/3,o,m_i(x). 



v/=0 



We then obtain the proof for general x by invoking continuity. 



□ 



A proof of the following has been shown to us informally by our colleague Xin Li, 
but it can also be found on p. 15 of [5]. 



Proposition 4. 



E 



X 

m 



biM = 0[^ 



Proposition 5. For every 6 > and x e I, 

E ^'■-w = o [i 

|x— // m|>5 



Proof. From [H p. 304], we find that for each 6 > and s = 1,2 there exists 

C = C((5, s) such that 

^ /),,„(x) < Cm-^ 

|x— // m|>5 

for m = 1, 2, . . . and x 6 /. □ 
4.3. Convergence of first derivatives of Bernstein approximations. 

Theorem 3. Suppose that f : 1" ^ R is a bounded function. Ttien for all x = 
(xi Xn) 6 (0, 1)" at which f is differentiable and for k = 1 n, we have 

d'B"Jf),_^, df 



lim „ 

m-J-oo OXif 



Proof Let us set 



f^ = ^"^{f) = Y,f{j^) 

where / = (/'i, . . . , in), each G {0, 1, . . . , m], and 

mm m 

E = EE-E. 



/ /i =0/2=0 /„=0 



We prove the proposition for the case k = 1. 
First, we have 



(4) 



dxi 



m 



E ^^m) (bii-l.m-l - (8) bi. 
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Second, by the differentiability of f at x, we have 



(5) ^(7^;) = fW + Ea^W 

where 



m 



m 



X 

m 



Xk 



and r}{s) ^ as s ^ in E". Next, making use of ([5|), it can be shown there is 
a constant M, dependent on x and n but independent of m, such that |??(s)| < M. 
This can be done by considering the case where s is "close" to and the case where 
s is some fixed distance from 0. 

Next, substituting from dHI) into dH), we see that (9fm/9xi)(x) becomes 



df 



m 



X 

m 



Now E,i(^i-l,m-l(Xl) - = /3_i,m_i(Xi) - 6m,m-l(Xl) = 0. ThuS 

^ f(x)(£),i_i,m_i(Xi) - £>,i,m-l(Xi)) /),2,m(X2) • • • £),„,m(x„) = 0, 
/ 

and for every /c > 1 we have 

X] ^^^^ ( m ~ ^'^j ('''i-l.'^-l^-^l) ~ ^i.m-lCxi)) /),2,m(X2) • • • £',„,m(>^n) 

= 0. 

On the other hand, it is easily seen that 



m 



f — - Xi j (6,1-1, m-i(xi) - 
'1 



and we know that J2i^ bj^^mixk) = 1 for /c > 1, therefore 



df 



m 



Thus we can write 
where 



OXi OXi 



m 



7i-l,m-l 



(Xl) 



-'/'i ,m— 1 



(^l)) /?/2.m(^2) • - ■ bi^^miXn)- 
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Our task now reduces to showing Sm 0. Choose e > 0. There exists 5 > such 
that if |s| < 6, then \ri{s)\ < e. Let us set 5, = \{i/m) — x\ and then break Sm into 
two pieces, Sm = S> + S^, where 



Sm = J2 s^ = Yl 



5i<S 



We first consider S<. By Proposition [T] 



Si>5 



Xl 



.(xi). 



Then 



xi(l-xi)'^'^''"^ 

'1 



< 



5,<(5 

me 



Xi(l - Xl) 



5,<5 



m 



Xl 



< 



me 



Xl(l - Xl) 



EE 

/ k=l 



k 




k 


Xk 




Xl 


m 




m 



\,mixi) fa/2,m(x2) " ' • fa,„.m(x„) 
J;i,m(xi) • ■ • /3,„,m(Xn). 



Now X],i('i-"ixi)^6/i ,m(xi) is the variance of a binomially distributed random variable, 
so 



E 



m 



Xl £i,i,m(Xl) 



Xl(l - Xl) 



m 



On the other hand, for k ^ 1, we have by Proposition IH 



EE 

'1 'k 
It follows that 



k 




ik 


Xl 




Xk 


m 




m 



Now we turn to S^. We know the following: 

(1) \v\ < M. 

(2) 5, < v^. 

(3) {/ : |^-x|>5}cULi{' : |^-^'<|>!}- 
Using these facts plus Proposition [5] we obtain 

n 

<mMs^ E^'*'"(^'') 
|^->^*|>^ ''^^ 
= mM^ n o(^) = 0(1). 
We therefore conclude that 

|Sm| <eO(l) + o(l)^0 

as m 00. 



□ 
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5. A probabilistic interpretation of the Bernstein approximation of a copula 

It is our goal here to construct random variables such that the Bernstein approx- 
imation is the cumulative distribution function of these new random variables. This 
probabilistic interpretation was brought to our attention by J. H. B. Kemperman in 

m 

Let C be an n-copula. Suppose it is the cumulative distribution function of the or- 
dered n-tuple of random variables (Xi X„) where each X, is uniformly distributed 

over /. Let m be a "large" natural number and Cm be the m x ■ ■ ■ x m Bernstein 
approximation of C; that is 

m 

C^(xi....,X„)= J2 ^im' 

'1 in=0 

where xi Xn G /. 

Next, for / = 1, . . . , n and j = 1, . . . , m, we let Xj be independent random variables 
that are uniformly distributed over / and have the property that (Xi, . . . , Xn) and Xj 
are independent for all ij. If it is helpful, we may regard these random variables as 
being defined over the space /" x Z'"" and having probability measure P where P has 
the form ij,c x A'"" and where it is understood that iJ,c is the probability measure 
induced on /" by C and A"'" is Lebesgue measure on /"'". 

Now for each /, let xj^^ xj"'^ be the order statistics for Xf X,*". That is, 

whenever 

Xf < ••• < Xf , 

then X,^**^ = Xf. 

It may be helpful to notice that by the independence and uniform distribution of 
our original random variables, for all iJ, r, s and all x G / we have 

p(x; = X,") = if x; 7^ X,^ 

P(X,'^' = x) = 0, 

p(xP = x''^') = if xP ^ X^/\ 

Next, if k is the multi-index {ki, . . . , kn) where kr = 0,1, m — 1, then we define 

fki ki + l \ (kn kn + l \ {kr K + 1 \ 

lmk = [—• X • • • X — , = <^ — < Xr < : r = l n> . 

\m m J \m m J [ m m J 

This is a slight abuse of notation since Xr "lives" in /" x rather than /", however the 
reader should easily make whatever mental adjustments are necessary in the arguments 
that follow. We then take Xk to be the characteristic function of /^^^ with the 
understanding that the domain of Xk is /". 
Finally we define 

m-l 
ki k„^0 

where r = 1, . . . , n and k = {ki, . . . , /c„). We then have the following: 
Theorem 4. Cm(xi x„) = P{Yi < xi V), < Xn) where (xi x„) £ /". 



■■■.7?;) ^i,m(xi)---6,„,m(x„) 
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Proof. For x G / and / = 1, . . . , n, we form a random variable X*{x) by setting 
X*{x) = the number of X/ XI" less than x. 

We see that the following are true: 

(1) X*{x) takes values in {0, 1 m}. 

(2) X*(x) and X*{y) are independent for / 7^ j. 

(3) P(x;(x) = ^) = (-) x" (1 - xr-' = b,M- 

We also see that 

{x;(x) = o} '-i^'- {x<xW}. 

(6) {X*{x) = k}''-^' {Xj'^ <x<xl'+'^} for/c = l. 



m — 1, 



From ([6]) we can easily deduce 

(7) {X;(x) > k] ''-=■ {Xj'^ < X} 

for / = 1 n and /c = 0, 1 m. 

We now consider the Bernstein approximation to C. Let xi x,, £ /. Then 



X„)= ^ C(i ^) bl,,m{xi) ■ ■ ■ bl„.m{Xn) 



il in = l 

il in 



/(, - 1 



< X, < - : r = 1, 
m m 



Since 



we have 



/l /n — 1 /tl — 1 /(n — 1 

P(X;(x,) = /V : r=l n) 

m ir m m 

EE = EE. 



Cm(Xi X„) 



^ E E-E 

ki kn = l ii=ki in = ^n 



kr 1 , , kf 

— <Xr <— : r = l, 

m m 



^ E 

kl kn^l 



^ E 

kl k„^l 



P{X;{xr) = ir : r = l n) 



1_ , , kf 
<Xr<— : r = 1, 



P(X;(xr) > /c, : r = 1 n) 

r = 1 n 



< X, < 
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P (X/"') < Xr 



where the last step follows from ([7]). 

We now consider the cumulative distribution function of (Vi, 

P{Yr <Xr : r = 1 n) 



m 

E 

/fl kn = l 

m 

E 

ki k,-l 



P{Yr<Xr, (Xi Xn) 6 C,, : r=l n) 



P (X/"') < Xr 
kr-l 



<Xr< 



kr 



m m 

by the definition of Yr and the independence of the random variables. We see from 
this that 

Crr,{xi X„) = P(Vi < Xi Y„<X„). □ 
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